Tag
6 articles
This article explains what frontier AI is, why safety testing matters, and how government oversight can help protect people from potential AI risks.
The U.S. government has gained pre-release access to AI models from five major tech labs for national security testing, as cybersecurity threats rise and the tech race with China intensifies.
ZDNET has developed a comprehensive testing framework to evaluate the rapidly evolving AI ecosystem, combining automated and manual evaluation techniques for reliable assessments.
Learn what AI Red Teaming is, how it works, and why it's essential for creating safe and fair AI systems. This beginner-friendly guide explains why testing AI models before deployment is so important.
Galtea raises $3.2M to help enterprises test AI agents, addressing the gap between demo and production performance.
OpenAI's GPT-5.4 shows impressive capabilities but often fails to answer the specific questions asked, raising concerns about its practical utility in professional settings.